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Abstract 

Genetic factors play very important roles in the onset and progression of type 2 diabetes mellitus {J2DM}. However, the 
genetic factors correlating with T2DI\/1 onset have not as yet been fully clarified. We previously found that copy number 
losses in the subtelomeric region on chromosome 4p16.3 were detected in early-onset Japanese T2DI\/1 patients (onset age 
<35 years) at a high frequency. Herein, we additionally found two novel copy number losses within the subtelomeric 
regions on chromosomes 16q24.2-3 and 22ql3.31-33, which have significant associations with early-onset Japanese T2DM. 
The associations were statistically significant by Fisher's exact tests with P values of 5.19x10"^ and 1.81 xlO~^ and odds 
ratios of 5.7 and 4.4 for 16q24.2-3 and 22ql3.31-33, respectively. Furthermore, copy number variation (CNV) analysis of the 
whole genome using the CNV BeadChip system verified simultaneous copy number losses in all three subtelomeric regions 
in 11 of our 100 T2DIV1 subjects, while none of 100 non-diabetic controls showed the copy number losses in all three 
regions. Our results suggest that the mechanism underlying induction of CNVs is involved in the pathogenesis of early- 
onset 12DM. Thus, copy number losses within multiple subtelomeric regions are strongly associated with early-onset T2DM 
and examination of simultaneous CNVs in these three regions may lead to the development of an accurate and selective 
procedure for detecting genetic susceptibility to T2DM. 
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Introduction 

Numbers of patients with type 2 diabetes mellitus (T2DM) have 
been increasing annually worldwide. Environmental factors, such 
as aging, westernized high-fat diets, lack of exercise and everyday 
stress affect the onset and progression of T2DM. On the other 
hand, according to epidemiological studies, genetic factors also 
play a major role [1]. For example, concordance rates and 
heritabUity for T2DM are higher in identical than in fraternal 
twins [2] . When both parents have T2DM, the risk of this disease 
in their children is three to four times higher than that of the 
offspring of non-diabetics [3]. Furthermore, the sibling recurrence- 
risk ratios for T2DM were substantially elevated in families with 
diabetes in both a parent and a grandparent [4]. Therefore, many 
investigators worldwide have been intensively searching for genetic 
factors associated with susceptibility to T2DM. 

Loci for rare monogenic forms of familial diabetes, such as 
maturity-onset diabetes of the young (MODY) [5], including 
glucokinase gene mutations [6], mitochondrial diabetes [7,8], and 
Wolfram syndrome [9], have been demonstrated in small 
proportions of patients. However, the genetic etiology of familial 
T2DM has not been revealed in the majority of cases. To identify 



genetic factors conferring susceptibility to T2DM, genome-wide 
association studies (GWASs) using single nucleotide polymorphism 
(SNP) markers have been performed. These GWASs and 
replication studies have identified multiple susceptibility loci, such 
as TCF7L2 [10], GLIS3 [11], KCMK16 [11,12], MAEA [12] and 
KCMQJ [13,14]. However, their odds ratios are consistently in the 
1.1-1.4 range [9,13], suggesting their contributions to be relatively 
low. Furthermore, among the loci detected to date, the initially- 
reported associations have not necessarily been replicated in 
subsequent studies [15,16]. 

Other chromosomal mutations, which cover wider genomic 
regions, include duplication, deletion, insertion, translocation, 
block substation, indel (insertion-deletion) and copy number 
variations (CNVs). Among these genetic variations, CNVs are 
increasingly being recognized due to their major contributions to a 
number of diseases, including certain malignancies. To date, the 
existence of CNV was considered to be very rare, since CNV 
would have a fatal impact on brain function and/ or the survival of 
individuals [17]. However, recent advances in human genome 
studies have totally revised this concept. Many case reports have 
made it clear that CNVs exist even in healthy individuals [15,16]. 
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Table 1. Characteristics of T2DM patients and controls. 









Cases (n = 100) 


Controls (n = 100) 


P 


Male 


59/1 00 


35/100 


0.0011 


Age of onset 


21.9±7.3 






Family history of T2DM 


94/1 00 


0/100 


0.0001 


Age of examination 


37.1 ±13.0 


72.8±6.4 


0.0001 


Maximum body mass index (l<g/m^) 


28.0 ±4.2 


25.2±3.2 


0.64 


HbAlc (NGSP)(%) 


8.4±2.2 


5.5±0.3 


0.0008 


Postprandial plasma glucose (mg/dl) 


171.4±79.7 


99.4±15.6 


0.0006 





doi:l 0.1 371 /journal.pone.0088602.t001 



For example, a CNV in the CCR3L1 region is reportedly related 
to HIV infection [18] .A CNV in the FCGR3 gene was reported to 
correlate with glomerulonephritis in systemic lupus erythematosus 
patients [19,20]. Therefore, CNVs are recognized as significant 
genetic contributors to human diseases [21]. Furthermore, in 
common diseases, such as schizophrenia [22,23], autism [24-27] 
and bipolar disorder [28], CNVs have been demonstrated to be 
contributory genetic factors. However, there are only a few reports 
demonstrating associations between CNV and T2DM [29-32]. 
Recently, we demonstrated copy number losses within the 1.3-Mb 
subtelomeric region on chromosome 4pl6.3 in a substantial 
portion of early-onset Japanese T2DM subjects [32]. 

In our search for other CNVs conferring genetic susceptibility to 
T2DM, we performed genome-wide CNV analyses in early-onset 
Japanese T2DM subjects. Early-onset patients had developed 
T2DM before environmental and acquired factors would have 
had major impacts, suggesting these patients to have stronger 
inherited factors. Therefore, we enrolled T2DM subjects whose 
onsets had been before 35 years of age. As a result, 94 of our 100 
T2DM subjects had a family history of T2DM within third-degree 
relatives [32] . We first screened CNVs in the whole genome using 
the deCODE-Elumina CNV370K BeadChip system which 
focuses on the CNV-rich region of the human genome, followed 
by validation and characterization using an Agilent region- 
targeted high-density custom-made oligonucleotide tiling micro- 
array [32]. We found two novel copy number losses within the 
multiple subtelomeric regions on chromosomes 16q24.2-3 and 
22ql3. 31-33, which have associations with early-onset diabetes. In 
addition, importandy, copy number losses in chromosomes 
4pl6.3, 16q24.2-3 and 22ql3. 31-33 were often detected simulta- 
neously in the same subjects. In particular, all three copy number 
losses were verified in 1 1 of our 1 00 T2DM patients, while none of 
the non-diabetic controls showed simultaneous copy number losses 
in these regions. Our results suggest that the combination of CNVs 
in these three genome regions is a potential DNA marker for 
T2DM, and that the mechanism underlying induction of CNVs, 
rather than a single gene in these CNV regions, is involved in the 
pathogenesis of T2DM. 

Materials and Methods 

Study population 

We considered early onset of T2DM to reflect the presence 
more of genetic than of environmental factors. Therefore, we 
enrolled diabetic patients with early-onset disease as study subjects. 
One hundred unrelated Japanese T2DM patients, who had 
developed T2DM before 35 years of age and whose maximum 
body mass index (BMI) was less than 35 (kg/m ), were the same 



subjects as in our previous study [32]. They were recruited at 
Tohoku University Hospital and affiliated hospitals and medical 
clinics. Diabetes had been diagnosed by applying the WHO 
criteria. Type 1 diabetes mellitus was excluded based on chiiical 
features and the absence of anti-GAD (glutamic acid decarbox- 
ylase) antibodies or anti-IA-2 (insulinoma-associated antigen-2) 
antibodies. Patients with diabetes meUitus due to hepatic disease, 
pancreatic disease, other endocrine disorders or mitochondrial 
DNA mutations, and those with drug-induced diabetes, based on 
laboratory data and clinical history, were excluded. We also 
studied 100 non-diabetic control subjects, again the same subjects 
as in our previous study [32] . These control subjects were enrolled 
using the following criteria: 60 or more years of age, no prior 
diagnosis of diabetes meUitus, HbAlc less than 6.4% and absence 
of a family history of T2DM within third-degree relatives. 
Applying these criteria allowed us to exclude subjects with a high 
likelUiood of later developing T2DM. HbAlc (%) was estimated as 
an NGSP (National Glycohemoglobin Standardization Program) 
equivalent value (%) calculated by the following formula: HbAlc 
(%) = HbAlc JDS: Japan Diabetes Society value) (%)+0.4%. The 
value of HbAlc JDS) (%) measured based on the previous 
Japanese standard substance and measurement methods. 

Genetic analysis of human subjects was approved by the ethics 
committee of Tohoku University Graduate School of Medicine. 
After a detailed description of study participation, written 
informed consent was obtained for each subject. 

Screening methods with the Whole-Genome CNV 
BeadChip System 

As previously reported [32], we screened the whole genome by 
CNV analysis using the deCODE-IUumina CNV370K BeadChip 
system (lUumina Infinium System, deCODE Genetics, Inc., 
Iceland), which, in addition to Hap300 SNP marker contents, 
has CNV probes designed to target the CNV-rich region of the 
whole genome. The CNV portion of the platform consists of 
probes covering CNV rich regions of the genome, such as 
megasateUites (tandem repeats >500 bp), duplicons (regions 
flanked by highly homologous segmental duplication > 1 kb), 
unSNPable regions (>15 kb gaps in HapMap SNP map, and 5— 
15 kb gaps with >2SNPs with Hardy-Weinberg failure), and 
CNVs registered in the Database of Genomic Variants. The CNV 
portion of the probe content consists of 15,559 CNV segments 
covering 190 Mb, or 6% of the human genome. The platform has 
been tested in 4000 Icelandic and HapMap samples. 

Data analysis of the deCODE-IUumina CNV chip was carried 
out using DosageMiner software developed by deCODE genetics, 
and loss/gain analysis consisted of the following four steps; (1) 
intensity normalization and GC content correction, (2) removal of 
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Figure 1. Genomic region harboring copy number loss of 1.75-Mb 16q24 (A) or 1.85-iVlb 22q13 (B) subtelomeric region in 15 early- 
onset T2DM patients. A) Data obtained employing the deCODE/lllumina CNV370K chip system were analyzed using the PennCNV program. 
Genome structures of the 15 patients and 3 normal subjects are aligned as horizontal bars from genome position 86,950,000 (left) to position 
88,700,000 (right). Dark solid horizontal bars represent the extent of copy number loss in each of the subjects. Gray regions between the dark bars 
represent intervals where copy number loss could not be inferred due to poor probe coverage. The lower map is an ideogram of chromosome 16. 
Position is given relative to NCBI Build 36 for chromosome 16. B) Data obtained employing the deCODE/lllumina CNV370K chip system were analyzed 
using the PennCNV program. Genome structures of the 22 patients and 6 normal subjects are aligned as horizontal bars from genome position 
44,750,000 (left) to position 49,550,000 (right). Dark solid horizontal bars represent the extent of copy number loss in each of the subjects. Gray 
regions between the dark bars represent intervals where copy number loss could not be inferred due to poor probe coverage. The lower map Is an 
ideogram of chromosome 22. Positions are given relative to NCBI Build 36 for chromosome 22. 
dol:1 0.1 371/journal.pone.0088602.g001 



batch effects using principal component analysis, (3) calling of 
clusters using a Gaussian mixture model, and (4) determination of 
CNV type using graphical constraints. In brief, CNVs were 
identified when CNV events stood out in the data, as all sample 
intensities for CNV probes should be increased or decreased 
relative to neighboring probes that are not in a CNV region. To 
determine deviations in signal intensity we started by normalizing 



the intensities. The normalized intensities for each color channel 
were determined by an equation and fit formula developed by 
deCODE genetics. A stretch with occurrence of more than one 
marker showing an abnormality in the copy number in a 
consecutive stretch in the genome is considered more likely to 
be evidence of deletion or gain [22] . 
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microarray analysis based on an array comparative genomic 
hybridization (aCGH) assay as previously reported. We fabricated 
a custom-designed microarray targeted to a 1.75-Mb genome 
region in the subtelomere at 16q24.2-3 (Chr. 16: 86,950,000- 
88,700,000 (NCBI Build 36.1, hgl8)), and a 4.8-Mb genome 
region in the subtelomere at 22ql3. 31-33 (Chr. 22: 44,750,000- 

49.550.000 (NCBI Build 36.1, hgl8)) according to previously 
described methods [33,34] . In short, we used the Agilent website 
(http://earray.chem.agilent.com/earray/) to select and design our 
custom tiling array; the array consisted of probes 60-mer in size 
(Agilent Technologies, Santa Clara, CA, USA). 

TUing-aCGH experiments were performed essentially as 
described previously [35]. In brief, test and reference (NA 19000, 
a Japanese male from the HapMap project) genomic DNAs 
(250 ng per sample) were fluorescently labeled with Cy5 (test) and 
Cy3 (reference) with a ULS Labeling Kit (Agilent Technologies). 

For each sample, respective labeling reactions were mixed and 
then separated prior to hybridization for each of the arrays. 
Labeled test and reference DNAs were combined, denatured, pre- 
annealed with Cot-1 DNA (Invitrogen, 1600 Faraday Avenue, 
Carlsbad, CA, USA) and a blocking agent, and then hybridized to 
the arrays for 24 hours in a rotating oven at 65°C and 20 rpm 
(Agilent Technologies). After hybridization and washing, the 
arrays were scanned at 3 |im resolution with an Agilent G2505C 
scanner. Images were analyzed with Feature Extraction Software 

10.7.3.1 (Agilent Technologies), with the CGH 107 Sep09 
protocol for background subtraction and normalization. Abnormal 
copy numbers, losses, and gains, in a complex multi-copy variable 
region by high-density tiling array were detected by deviation of 
probe log2 ratios that exceeded a threshold of 1 SD from the 
median probe ratio, according to procedures described previously 
[35-37]. We defined two copy number classes, that is, "unchanged 
copy number" and "copy number loss." "Unchanged copy 
number" was defined as the log2 ratio staying within the mean± 
1 SD distribution for the normal population. "Copy number loss" 
was considered to be present when the downward-deviation of 
log2 ratios exceeded a threshold of 1 SD from the median probe 
ratio. 



Figure 2. Structural details of copy number loss of 1.75-Mb 
16q24 (A) or 1.85-Mb 22q13 (B) subtelomeric region resolved 
by high-density tiling microarray. A) The log2 ratio (y-axis) was 
plotted using the moving average along the genome position (x-axis). 
Representative early-onset T2DM data, from patients 1, 2, 3, and 4, 
respectively, are shown in Figures 2.1(a), 2.1(b), 2.1(c), and 2.1(d). The 
black vertical bar shows the copy number plot along the genome. The 
two pale lines indicate the normal range of average log2 ratios for 
probes among normal individuals. Copy number losses are displayed as 
red vertical bars. "Copy number loss" was considered to be present 
when the downward-deviation of log2 ratios exceeded a threshold of 
1SD from the median probe ratio. B) The log2 ratio (y-axis) was plotted 
using the moving average along the genome position (x-axis). 
Representative early-onset T2DiVl data from patients 1, 2, 3, and 4, 
respectively, are shown as Figures 2.2(a), 2.2(b), 2.2(c), and 2.2(d). The 
black vertical bar shows the copy number plot along the genome. The 
two pale lines indicate the normal range of average log2 ratios for 
probes in normal individuals. Copy number losses are displayed as red 
vertical bars. "Copy number loss" was considered to be present when 
the downward-deviation of log2 ratios exceeded a threshold of 1SD 
from the median probe ratio. 
doi:1 0.1 371 /journal.pone.0088602.g002 

High-Density Custom-Made Oligonucleotide Tiling 
Microarray Analysis 

DNA samples from early-onset T2DM subjects were subjected 
to Agilent's high-density custom-made oligonucleotide tiling 



Statistical analysis 

Statistical analyses were performed using JMP and Microsoft 
excel. The Chi square tests were used to compare characteristics of 
cases and controls, and the clinical characteristics of T2DM 
associated with combined copy number loss. Fisher's exact tests 
were used to examine whether there were differences in the copy 
number losses between cases and controls. 

Results 

In searching for other CNVs associated with early-onset T2DM, 
we screened the whole genome for CNVs using the deCODE- 
lUumina CNV370K BeadChip system in the same 100 early-onset 
Japanese T2DM subjects and 100 non-diabetic controls that we 
reported previously [32]. Characteristics of the diabetic cases and 
non-diabetic controls are shown in Table 1. To search for other 
CNVs as genetic variations possibly conferring susceptibility to 
T2DM, we performed genome-wide CNV analyses in early-onset 
Japanese T2DM subjects and non-diabetic controls. Early-onset 
patients had developed T2DM before environmental factors 
would have had major impacts, suggesting these patients to have 
stronger genetic factors. Therefore, we selected 100 Japanese 
T2DM subjects whose onsets had been before 35 years of age. In 
fact, 94 of our 100 T2DM subjects had a family history of T2DM 
within third-degree relatives (Table 1). As non-diabetic controls. 
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Figure 3. Extents of copy number losses within tiie 1.6-iVlb 16q24 (A) and 4.8-Mb 22q13 (B) subtelomeric regions in early-onset 
T2DM patients as revealed by high-density oligonucleotide tiling microarray analysis. A) Darl< horizontal bars represent the extents of 
copy number loss in each region in patients, as revealed by the Agilent custom tiling array analysis. Genome structures of the 1 0 patients subjects are 
aligned as horizontal bars from genome position 87,000,000 (left) to position 88,600,000 (right). Gray regions represent proximal ends of stretches 
where copy number status could not be inferred due to the presence of multiple low copy repeats. The upper map is an ideogram of chromosome 1 6 
and the positions of putative genes in the 16q24 region described in the NCBI Map Viewer (http://www.ncbi.nlm.nih.gov/projects/mapview). 
Positions are given relative to NCBI Build 36 for chromosome 16. B) Dark horizontal bars represent the extent of copy number loss in each of the 
patients, as revealed by the Agilent custom tiling array analysis. Genome structures of the 12 patients subjects are aligned as horizontal bars from 
genome position 44,750,000 (left) to position 49,550,000 (right). Gray regions represent proximal ends of the stretch where copy number status could 
not be inferred due to the presence of multiple low copy repeats. The upper map is an ideogram of chromosome 22 and the positions of putative 
genes in the 22q13 region described in the NCBI Map Viewer (http://www.ncbi.nlm.nih.gov/projects/mapview). Positions are given relative to NCBI 
Build 36 for chromosome 22. 
doi:1 0.1 371/journal.pone.0088602.g003 
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genomes obtained from subjects who were at least 60 years of age 
and had not developed diabetes were used. In addition to the 
CNV in 4pl6.3 [32], interestingly, those of 16q24.2-3 and 
22ql3. 31-33 were found to be located on the subtelomeric regions 
in each chromosome. At the CNV on 16q24.2-3, 15 of the 100 
T2DM subjects displayed copy number loss around this CNV 
marker, compared with only 3 of the 100 non-diabetic control 
samples. At the CNV on 22ql3. 31-33, 22 of the 100 T2DM 
subjects displayed copy number loss around this CNV marker, 
compared with only 6 of the 100 control samples. Hereafter, we 
focused on the two subtelomeric CNVs on 16q24.2-3 and 
22ql3.31-33. 

Figure 1 shows the patterns of alterations in copy number loss 
observed in the 100 early-onset Japanese T2DM subjects and non- 
diabetic controls. Among diabetic subjects, 15 showed copy 
number losses in the 16q24.2-3 subtelomeric region, whereas only 
3 of the 100 non-diabetic control samples had copy number losses 
in this region (P= 5.19x10 — 3, OR = 5.7, 95% confidence interval 
1.6-20.4) (Figure lA). For the 22ql3. 31-33 subtelomeric regions, 
22 diabetic subjects showed copy number losses around the gap 
region, whereas only 6 of the 1 00 non-diabetic control samples 
had copy number losses in this region (P= 1.81 x 10 — 3, OR= 4.4, 
95% confidence interval 1.7-11.4) (Figure IB). We observed two 
copy number patterns, that is, "unchanged copy number" and 
"copy number loss", at these two regions in our diabetic and 



A 



B 




Figure 4. Details of the numbers of patients and controls with 
copy number loss in each region verified by the deCODE- 
lllumina CNV370K BeadChip system. Copy number losses were 
verified in 14 of the 100T2DIV1 patients in both 16q24.2-3 and 22q13.31- 
33, while only one of the 1 00 controls had copy number losses in these 
two regions. Copy number losses were verified in all three regions in 1 1 
of the 100 T2DM patients, while none of the 100 controls had copy 
number losses in all three regions. 
doi:1 0.1 371/journal.pone.0088602.g004 



control populations, while none of our subjects had detectable 
"copy number gain". Copy number loss was frequently observed 
at the 16q24.2-3 and 22ql3. 31-33 subtelomeric regions in early- 
onset T2DM subjects. 

To verify the CNV BeadChip results, we analyzed copy number 
changes along these two regions in the subtelomere using a high- 
density custom-made oligonucleotide tiling microarray. Structural 
details of the copy number losses in each of four representative 
T2DM patients with copy number losses in 16q24.2-3 and 
22ql 3.3 1-33 subtelomeric regions are shown in Figures 2A and 
2B, respectively. Individual copy number plots using moving 
average (y-axis) versus distance along the chromosome (x-axis) are 
presented. For comparison, copy number plots of healthy 
individuals who did not exhibit copy number alterations in the 
region are also shown ((e) and (f) of Figures 2A and 2B). Using this 
procedure, we examined copy number losses in the 16q24.2-3 and 
22ql3. 31-33 subtelomeric regions in 10 of 15 and 12 of 22 
subjects, respectively, in whom copy number losses in each region 
had been detected by means of the first genome-wide screening 
using the deCODE-IUumina CNV370K BeadChip system. Copy 
number losses were confirmed in all 22 subjects examined 
(Figures 3A and 3B). In addition, the high-density tiling 
microarray further verified these copy number losses to actually 
be segmental losses. Thus, the first screening results were 
confirmed by the more detailed procedure. 

As we carried out these analyses, we recognized that the CNVs 
were often detected in the same subjects. For instance, all 10 
subjects who were confirmed to have the copy number loss in 
16q24.2-3 using the high-density tiling microarray analysis 
simultaneously had the copy number loss in 22ql3. 31-33. 
Therefore, we calculated the frequencies of these two CNVs as 
well as the one on 4pl6.3 [32] in all 100 diabetic and 100 non- 
diabetic subjects, according to the results obtained using the 
deCODE-IUumina CNV BeadChip system. As shown in 
Figure 4A, all diabetic subjects with copy number loss on 4pl6.3 
or 16q24.2-3 had another CNV(s). In addition, 15 of the 22 
diabetics with copy number loss on 22ql3. 31-33 simultaneously 
had another CNV(s). Intriguingly, 11 of the 100 T2DM subjects 
had copy number losses in all three subtelomeric regions (Figures 4 
A), while none of the 100 controls had copy number losses in all 
three regions (Figure 4B) (P— 7.00 x 10 * by Fisher's exact test). In 
contrast, of the study subjects with copy number losses in two of 
the three subtelomeric regions, 5 were diabetic (Figures 4 A), while 
2 were controls (Figure 4B), with the difference not reaching 
statistical significance (P= 4.45x10 '). Thus, simultaneous copy 
number losses in these three different chromosomes are clearly 
associated with T2DM development and may become a novel 
DNA marker, usuful at least for detecting subjects likely to have 
early-onset T2DM. 

We next examined the clinical characteristics of T2DM 
associated with combined copy number losses. Sex and onset 
age did not differ between patients with copy number losses in all 
three subtelomeric regions (n = 11) and those with no copy number 
losses in all three regions (n = 89). All 11 patients with copy 
number losses in all three subtelomeric regions had a family 
history of T2DM within third-degree relatives, supporting the 
notion of combined copy number losses being a genetic factor 
conferring high susceptibility to T2DM development. However, 
maximal body mass index, HbAlc and postprandial plasma 
glucose levels did not differ between these two groups (Table 2). In 
addition, urinary C -peptide reactivity levels and ratios of receiving 
insulin therapy were similar (Table 2), suggesting that patients with 
copy number losses in all three regions do not have specific clinical 
features. Therefore, it is possible that these combined copy 
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Table 2. Characteristics of T2DIV1 patients with and without CNVs in all three regions based on analysis using the deCODE-lllumina 
CNV BeadChip system. 







Patients with copy number losses 
in all three regions (n = 1 1 ) 


Patients without copy number 
losses In all three regions (n = 89) 


P 


male 


6/11 


53/89 


0.75 


Age of onset 


21.1 ±8.9 


22.3±7.9 


0.42 


Family history of T2DM 


Yes 


11/11 


83/89 


0.37 


Maximum body mass index (kg/m^) 


28.8±5.3 


27.9±3.9 


0.31 


HbAlc (NGSP)(%) 


8.3±2.8 


8.4±2.3 


0.54 


Postprandial plasma glucose (mg/dl) 


169.2 ±68.9 


174.2 ±76.2 


0.63 


Use of insulin injection therapy 


Yes 


9/11 


54/89 


0.74 


Urine C-peptide reactivity level (^ig/day) 


67.8±88.3 


59.9±91.2 


0.29 



Onset age, maximum body mass index, postprandial plasma glucose levels, family history, HbAlc levels, and urinary C-peptide reactivity levels did not differ betw/een 
the 1 1 patients with copy number losses in all three regions and the other 89 patients. 
doi:l 0.1 371 /journal.pone.0088602.t002 



number losses would often be detectable in early-onset T2DM 
patients. Thus, detection of these copy number losses may be a 
novel and useful diagnostic measure for predicting the early onset 
of T2DM. 

Discussion 

This study revealed two novel copy number losses in the 
subtelomeric regions in 16q24.2-3 and 22ql3. 31-33. Our initial 
genome-wide screening with the deCODE-IUumina CNV370K 
BeadChip system for association with early-onset T2DM revealed 
novel losses in the subtelomeric regions of both 16q24.2-3 and 
22ql3. 31-33. Subsequent high-density custom-made oligonucleo- 
tide tiling microarray analysis verified copy number losses in these 
regions. Furthermore, taken together with the results of our 
previous study [32], simultaneous copy number losses in three 
(4pl6.3, 16q24.2-3 and 22ql3.31-33) subtelomeric regions were 
detected in 11 of 100 T2DM patients, while none of the 100 
controls had copy number losses in all three subtelomeric regions 
(Figures 4). This result suggests that simultaneous copy number 
losses in these three subtelomeric regions constitute a potentially 
novel and useful strategy, with high accuracy and selectivity, for 
predicting susceptibility to early-onset T2DM. 

The CNV regions found in 4pl6.3, 16q24.2-3 and 22ql3. 31-33 
contain a number of genes (Figure 3) which might be involved in 
T2DM development. For example, genes involved in the glucose- 
induced insulin secretion cascade of pancreatic beta-cells, such as 
ATP5I (ATP synthase, H+ transporting, mitochondrial FO 
complex, subunit E) [38], CPLXl (complexin 1) [20] and OAK 
(cyclin G associated kinase) in association with CDK5 (cyclin- 
dependent protein kinases 5), are located in 4pl6.3 [39,40]. 
PPARa (peroxisome proliferator-activated receptor alpha) located 
in 22ql 3.3 1-33 is reportedly involved in T2DM onset and 
progression [32,41]. Therefore, there is a possibility that these 
and other as yet unknown genes are responsible for T2DM 
pathogenesis through their impairments caused by copy number 
losses. However, these genes were not necessarily all deleted in the 
1 1 patients who had copy number losses in all three subtelomeric 
regions (Figure 2). In addition, we also discovered marked 
accumulation of CNVs in the same subjects with T2DM, as 
compared with non-diabetic controls. In particular, 1 1 of our 
T2DM subjects harbored all three subtelomeric copy number 



losses, whereas none of the controls did. These results suggest that 
copy number losses are not a direct cause of T2DM development. 
Instead, copy number losses per se appear to have been induced 
by another major mechanism(s) which is also involved in T2DM 
development. Therefore, a putative cause for CNVs, rather than a 
single gene defect in the CNV regions, is involved in the onset and 
progression of T2DM. 

As shown in Table 1 , the patients with all three copy number 
losses did not exhibit specific clinical characteristics of T2DM. 
Body weights, insulin secretion and insulin therapy ratios did 
not differ from those of other early-onset T2DM patients. 
Therefore, as yet, we cannot distinguish these patients with 
CNVs based on their clinical findings. In other words, these 
patients may be identified among the entire population of 
early-onset T2DM patients, and possibly even in the late-onset 
T2DM population as well. More studies are needed to 
elucidate the rate of CNVs in T2DM subjects, in general, by 
examining these CNVs in larger populations. Since DNA 
samples were collected from individuals living in northeastern 
Japan, local factors should be taken into consideration. 
However, there are no apparent locality differences between 
our T2DM subjects and the non-diabetic controls. In addition, 
neither common nor other CNVs tended to be shared between 
the patient and control groups (data not shown). These results 
collectively indicate the combined CNVs to be strongly 
associated with T2DM, at least in subjects living in northeast- 
ern Japan. Considering that genomes uniquely correlating with 
T2DM have been found in East Asian countries like Japan, 
Korea and China [30], the CNVs in these three regions might 
be related to racial inherited factors. In addition, because of 
the small sample size, we cannot rule out the possibility that 
the contribution of CNVs to T2DM development might be 
overestimated. Further intensive studies with a larger popula- 
tion are needed to clarify these issues, including generality, 
local factors and pathogenesis. 

In conclusion, copy number losses within multiple subtelomeric 
regions are strongly associated with early-onset T2DM. Examin- 
ing the CNVs in three regions may be a useful strategy for 
predicting T2DM as well as possibly providing insights into its 
pathogenesis. 
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